Unsupervised learning of rhetorical structure with un-topic models
نویسندگان
چکیده
In this paper we investigate whether unsupervised models can be used to induce conventional aspects of rhetorical language in scientific writing. We rely on the intuition that the rhetorical language used in a document is general in nature and independent of the document’s topic. We describe a Bayesian latent-variable model that implements this intuition. In two empirical evaluations based on the task of argumentative zoning (AZ), we demonstrate that our generality hypothesis is crucial for distinguishing between rhetorical and topical language and that features provided by our unsupervised model trained on a large corpus can improve the performance of a supervised AZ classifier.
منابع مشابه
Jointly Modeling Topics and Intents with Global Order Structure
Modeling document structure is of great importance for discourse analysis and related applications. The goal of this research is to capture the document intent structure by modeling documents as a mixture of topic words and rhetorical words. While the topics are relatively unchanged through one document, the rhetorical functions of sentences usually change following certain orders in discourse....
متن کاملThe Effect of Cognitive Factors of Rhetorically Different Listening Tasks on L2 Listening Quality of Iranian Advanced EFL Learners
This study examined the effect of two different authentic topic-familiar rhetorical L2 listening tasks (expository and argumentative) differing in reasoning demand on the listening comprehension scores of a number of Iranian EFL advanced learners. Sixty homogeneous advanced learners were recruited based on their performance on an English Language Proficiency test (Fowler & Coe, 1976). Then they...
متن کاملTraffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملExploring Temporal Patterns in Emergency Department Triage Notes with Topic Models
Topic modeling is an unsupervised machine-learning task of discovering topics, the underlying thematic structure in a text corpus. Dynamic topic models are capable of analysing the time evolution of topics. This paper explores the application of dynamic topic models on emergency department triage notes to identify particular types of disease or injury events, and to detect the temporal nature o...
متن کاملUnsupervised Declarative Knowledge Induction for Constraint-Based Learning of Information Structure in Scientific Documents
Inferring the information structure of scientific documents is useful for many NLP applications. Existing approaches to this task require substantial human effort. We propose a framework for constraint learning that reduces human involvement considerably. Our model uses topic models to identify latent topics and their key linguistic features in input documents, induces constraints from this inf...
متن کامل